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Abstract 

Hadronic event shapes have been measured in proton-proton collisions at = 
7TeV, with a data sample collected with the CMS detector at the LHC. The sample 
corresponds to an integrated luminosity of 3.2 pb -1 . Event-shape distributions, cor- 
rected for detector response, are compared with five models of QCD multijet produc- 
tion. 
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Event shapes provide information about the properties of hadronic final states from particle 
collisions. Suitably defined event-shape variables were among the first observables proposed 
to test the theory of quantum chromodynamics (QCD) |TJ |3 and have been important in en- 
abling progress in the theory. At e + e and ep colliders, event shapes have played a crucial 
role in the extraction of the strong coupling constant a s . They have been essential in tuning the 
parton shower and non-perturbative components of Monte Carlo (MC) event generators and 
have provided a laboratory for developing and testing analytical probes of the hadronization 
process. More recently a large set of event-shape variables suitable for pp colliders has been 
proposed |3J. An important aspect of these variables is their normalization to the measured 
sum of transverse momentum or energy of all the objects in the event. It is thus expected that 
energy-scale uncertainties should cancel to a large extent. Event-shape variables represent a 
valuable tool for early measurements of the properties of QCD multijet events at the Large 
Hadron Collider (LHC) and the tuning of MC models Q. 

This Letter presents the first measurement of hadronic event shapes with a data sample of 
7TeV proton-proton collisions collected with the Compact Muon Solenoid (CMS) detector at 
the LHC. The data sample corresponds to an integrated luminosity of 3.2 pb _1 . 

A detailed description of the CMS experiment can be found elsewhere 0. CMS uses a right- 
handed coordinate system, with the origin located at the nominal collision point, the x-axis 
pointing towards the center of the LHC ring, the y-axis pointing up (perpendicular to the LHC 
plane), and the z-axis along the anticlockwise beam direction. The polar angle 6 is measured 
from the positive z-axis, the azimuthal angle <p is measured in the xy plane, and the pseudora- 
pidity is defined as rj = — ln[tan(0/2)]. The central feature of the CMS apparatus is a super- 
conducting solenoid, of 6 m internal diameter, providing an axial field of 3.8 T. Within the field 
volume are the silicon pixel and strip tracker, the crystal electromagnetic calorimeter (ECAL), 
and the brass /scintillator hadron calorimeter (HCAL). Muons are measured in gas-ionization 
detectors embedded in the steel return yoke. In the region \tj\ < 1.74, the HCAL cells have 
widths of 0.087 in pseudorapidity and 0.087 rad in azimuth (cp). In the (t], <p) plane, and for 
\rj\ < 1.48, the HCAL cells map on to 5 x 5 ECAL crystal arrays to form calorimeter towers 
projecting radially outwards from close to the nominal interaction point. At larger values of 
\t]\, the size of the towers increases and the matching ECAL arrays contain fewer crystals. A 
preshower detector consisting of two planes of silicon sensors interleaved with lead is located 
in front of the ECAL at \t]\ > 1.479. In addition to the barrel and endcap detectors, CMS has 
extensive forward calorimetry covering the region 3.0 < \t]\ < 5.0. 

Two event-shape variables have been studied: the central transverse thrust t±£ and the central 
thrust minor T m £ ■ The two variables probe different QCD radiative processes and are mostly 
sensitive to the modeling of two- and three-jet topologies. The term central (C) indicates that 
the input to the calculation of these quantities are jets in the central region of the detector 
(\r/ 1 < 1.3), where sub-leading contributions in the calculation of the event-shape variables are 
less significant, and systematic uncertainties on the jet reconstruction are smaller. 

The central transverse thrust is defined as |3l 



= 1 — max 



Li \?u- n T 
Li Put 
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where p±j is the transverse momentum of selected jet i. The axis ftj which maximizes the sum, 
and thus minimizes T±jc, is called the thrust axis %,c- The central transverse thrust is a measure 
of the momentum in the plane defined by nx,c and the beam axis. The central thrust minor is a 
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measure of the momentum out of this plane and is defined as 

T _ Li \P±,i x %,cl «,x 

LA PU 

Two-jet events that are well balanced have low values of these two variables, while isotropic 
multijet events have high values. 

The transverse momenta of jets are used as input to the event-shape calculation. Jets are recon- 
structed using individual particles that have been identified, and whose energies have been 
measured, using a particle flow technique 0J, which combines information from all subde- 
tectors: charged tracks in the tracker and energy deposits in the electromagnetic and hadronic 
calorimeters, as well as signals in the preshower detector and the muon system. The energy cal- 
ibration is performed separately for each particle type. As a result, the input to the jet clustering 
is almost fully calibrated and the resulting jets require only a small energy correction (below 
10% in the central region). Jet clustering is performed using the anti-fcj- clustering algorithm 
with a distance parameter R = 0.5. 

Five MC generators are used to produce simulated samples for comparison with the data; the 
specifics of each generator are detailed below. In addition to the generator-level samples, we 
use "full-simulation" samples, where the events produced at the generator level are processed 
with a simulation of the CMS detector response based on Geant4 (8|. As event-shape distri- 
butions are sensitive to QCD radiation, they are primarily affected by the description of the 
parton showering and the hadronization process, and, to a lesser extent, by the description of 
multiparton interactions, which is included in all generators used. 

The first generator considered is PYTHIA 6.4.22 (PYTHIA6) EJ with tune D6T (TU|. In this ver- 
sion of PYTHIA, parton showers are ordered by mass. The second generator is PYTHIA 8.145 
(PYTHIA8) |TT| with tune 2C [12]. In this version of PYTHIA, parton showers are ordered by 
pj. The underlying event model is based on the multiple-parton interaction model of PYTHIA6, 
interleaved with initial- and final-state radiation. The third generator is HERWIG++ 2.4.2 Ifl3l 
used with the tune of older version 2.3. The parton showering in HERWIG++ is based on the 
coherent-branching algorithm, with angular ordering of the showers. The underlying event 
is simulated using an eikonal multiple parton-parton scattering model. The fourth is MAD- 
GRAPH 4.4.24 [14J in conjunction with PYTHIA6, with tune D6T. Events containing from two to 
four jets matched to partons with pj above 20GeV/c are produced with MadGraph using a 
matrix element (ME) calculation and subsequently passed to PYTHIA to generate parton show- 
ers (PS). The MLM matching procedure |15"| is used to avoid double counting between the ME 
and PS calculations. For the matching, the minimum jet pi threshold is set to 30GeV/c. Finally, 
the ALPGEN 2.13 [16 1 generator is used in a similar way to MADGRAPH. ALPGEN samples are 
produced separately for each jet multiplicity from two to six jets, matched to partons with pj 
above 20 GeV/c, and are weighted according to their theoretical cross section. Events produced 
with ALPGEN using the ME calculation are passed to PYTHIA, and the MLM matching proce- 
dure is used to avoid double counting. For the matching of ME partons to jets, the lower jet pj 
threshold is set to 20GeV/c and the maximum distance between partons and jets is kept to its 
default value of AR = 0.7. 

The data were collected between April and August 2010. Noncollision background is removed 
by applying quality cuts that ensure the presence of a well-reconstructed primary vertex fT7\. 
The selected data sample is then divided into three bins defined by Pt,i, the pj of the leading 
jet (the jet reconstructed offline with the highest pj). The low-px bin contains events with 90 < 
px,i < 125 GeV/c, the medium-p T bin with 125 < px,i < 200 GeV/c, and the high-p T bin with 
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px,i > 200 GeV/ c. Data in all bins are selected using single-jet triggers that require an online 
reconstructed jet with pj greater than 30GeV/c for the low-px bin, and greater than 50GeV/c 
for the medium-px and high-px bins. The trigger with a 30GeV/c threshold was prescaled as 
the instantaneous luminosity of the LHC increased; the effective luminosity in the low-px bin is 
only 0.32 pb _1 . The trigger efficiency measured from a sample acquired with lower-threshold 
triggers, is greater than 99% for all px bins. 

Quality cuts are imposed on the jets in order to remove spurious jets caused by calorimeter 
noise or other remaining noncollision background. Jets must consist of at least two particles, 
including at least one charged hadron, and not more than 99% of the jet energy may be carried 
by neutral hadrons alone, by photons alone, or by electrons alone. All jets within the detector 
acceptance (\t]\ < 5), with pj > 30GeV/c, and passing the quality criteria are subsequently 
considered. However, if one of the two leading jets does not pass the quality cuts, the event is 
rejected. This requirement rejects less than 1% of the events. After the initial jet selection, the 
two leading jets are required to be within \rj\ < 1.3, or the event is rejected. All selected jets 
within | ?/ 1 < 1.3 are used in the event-shape calculation. 

In the low, medium, and high-px data samples, this selection retains respectively 62 000, 180 000, 
and 23 000 events, of which 77%, 65%, and 52% are events with exactly two selected jets. 

The event-shape distributions are distorted by the energy and angular resolutions of the detec- 
tor. The measured distributions are unfolded to allow comparison with the event-shape dis- 
tributions calculated in the generator-level samples. We use a regularized unfolding method 
based on singular-value decomposition (SVD) of the response matrix [18] . The inputs to the un- 
folding algorithm are the distributions measured in data and the response matrix, determined 
from the event-shape distributions of the PYTHIA6 generator-level and full simulation samples. 
The algorithm returns the unfolded data distribution, together with its correlation and error 
matrices. All of the unfolding corrections are below 5%. 

The dominant systematic uncertainty in the event-shape distributions comes from the jet en- 
ergy scale. While the event-shape definitions are expected to be invariant under a shift in jet 
energy scale, the uncertainty on the energy scale modifies the number of jets passing the px 
threshold, thus affecting the event shapes. In order to estimate the resulting uncertainty on 
the event-shape distributions, a shift of the jet energy scale is applied to all jets entering the 
calculation, based on an uncertainty estimate which varies between 3 and 5%, depending on 
tj and px [19]. The maximum bin-by-bin difference between the original distributions and the 
two shifted distributions, after unfolding, is of the order of 4% and is assigned as a system- 
atic uncertainty. The effects of the angular resolution and the uncertainty on the jet energy are 
estimated in a similar way and are found to be insignificant. 

Energy resolution studies have revealed up to a 10% difference between the data and the MC 
simulation |tl9l from which the response matrix is estimated. In order to estimate the effect 
of this difference on the event-shape distributions, a new response matrix is determined from 
a MC sample in which the jet energies are smeared by an additional 10%. The differences 
between the original event-shape and the unfolded event-shape distributions obtained with 
the new response matrix are found to be below 1%. 

The robustness of the result against possible bias due to the unfolding was also tested. We 
checked that the differences between the unfolded data and each generator-level sample were 
comparable with the differences found between data before unfolding and the corresponding 
full-simulation sample. Here, the agreement between event-shape distributions in data and 
simulation was quantified by a x 2 statistic. We also ranked the simulated samples by their in- 
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creasing agreement with the data, and checked that the ranking was the same before and after 
unfolding. Alternative unfolding methods l[20j were applied; the resulting event-shape distri- 
butions differed by less than 1% from the results of the SVD unfolding. We also determined 
the response matrix using MADGRAPH MC samples instead of PYTHIA6 samples and found no 
significant differences. 

The increase in the instantaneous luminosity of the LHC was accompanied by an increase in the 
average number of interactions per bunch crossing. The effect on the event-shape distributions 
was checked by measuring these distributions separately for events with exactly one, two, and 
more than two primary vertices. The resulting three sets of distributions were found to agree 
within statistical errors. 

Finally, we constructed event-shape distributions with an alternative jet reconstruction, based 
exclusively on information from the tracking detectors ||21], and found that they agree within 
errors with the distributions based on particle flow reconstruction. 

The unfolded event-shape distributions from data and from the PYTHIA6, PYTHIA8, HERWIG++, 
MadGraph, and ALPGEN MC generators are shown in Figs. [T|-[3] for the low, medium, and 
high-pj samples, respectively. The error bars represent the statistical uncertainties on the data 
and the shaded (blue) bands represent the quadratic sum of the statistical and systematic un- 
certainties, discussed above. The ratios between data and MC simulations are shown in the 
lower plots of Figs. [T|-[3] for each of the five MC generators. 

The PYTHIA6 and HERWIG++ predictions agree with the measurements in all three momentum 
bins, while the ALPGEN and MadGraph curves deviate from the data as a result of an overesti- 
mate of the fraction of back-to-back dijet events, which enter the lower tail of the distributions. 
The PYTHIA8 predictions agree with the measurements in all bins of the event-shape variables 
except in the highest bin, where an underestimate is observed. This disagreement, however, 
affects only a very small region of the parameter space (0.5% of all events). 

Further studies indicate that, while the momentum of the first leading jet agrees well between 
data and MC generators, that of the second leading jet is higher in MadGraph than in the data. 
This results in differences in the distribution of A(p between the two leading jets, which then 
lead to a shift of the event-shape distributions towards lower values [22 J. We conclude that the 
regime at high jet momentum and large number of jets, where the explicit ME calculations of 
ALPGEN and MadGraph significantly improve the pure PS treatment, has not been reached. 

In conclusion, we have presented the first measurement of two event-shape variables, the cen- 
tral transverse thrust and the central thrust minor, using a data sample of proton-proton col- 
lisions at a center-of-mass energy of 7TeV, accumulated by the CMS detector at the LHC. The 
measured event-shape distributions are presented after correction for the detector response. 
We compare them with predictions from the PYTHIA6, PYTHIA8, HERWIG++, MADGRAPH, and 
ALPGEN MC generators. The event-shape distributions from PYTHIA6, PYTHIA8, and HER- 
WIG++ show satisfactory agreement with the data, while discrepancies are found between the 
data and predictions from ALPGEN and MADGRAPH. These measurements provide input for 
the improvement of currently available models of QCD multijet production. 

We wish to congratulate our colleagues in the CERN accelerator departments for the excellent 
performance of the LHC machine. We thank the technical and administrative staff at CERN and 
other CMS institutes, and acknowledge support from: FMSR (Austria); FNRS and FWO (Bel- 
gium); CNPq, CAPES, FAPERJ, and FAPESP (Brazil); MES (Bulgaria); CERN; CAS, MoST, and 
NSFC (China); COLCIENCIAS (Colombia); MSES (Croatia); RPF (Cyprus); Academy of Sci- 
ences and NICPB (Estonia); Academy of Finland, ME, and HIP (Finland); CEA and CNRS/IN2P3 
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Figure 1: Distributions of the logarithm of the central transverse thrust (top left) and central 
thrust minor (top right) for events with a leading jet pj between 90 and 125GeV/c, from data 
and from five MC simulations. The error bars on the data points represent the statistical uncer- 
tainty on the data, and the shaded (blue) bands represent the sum of statistical and systematic 
errors. The lower plots show the ratio between data and the different simulated samples. 
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Figure 2: Distributions of the logarithm of the central transverse thrust (top left) and central 
thrust minor (top right) for events with a leading jet pj between 125 and 200 GeV/c, from data 
and from five MC simulations. The error bars on the data points represent the statistical uncer- 
tainty on the data, and the shaded (blue) bands represent the sum of statistical and systematic 
errors. The lower plots show the ratio between data and the different simulated samples. 
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Figure 3: Distributions of the logarithm of the central transverse thrust (top left) and central 
thrust minor (top right) for events with a leading jet px,i > 200 GeV/c, from data and from five 
MC simulations. The error bars on the data points represent the statistical uncertainty on the 
data, and the shaded (blue) bands represent the sum of statistical and systematic errors. The 
lower plots show the ratio between data and the different simulated samples. 
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